APPLICATION UNDER UNITED STATES PATENT LAWS 



Atty. Dkt.No. PM 277090 

(M#) 

Invention: METHODS 

Inventor (s): ANAND, Rakesh 
SMITH, John C. 
MORTEN, John E N. 




Plllsbury WInthrop LLP 
Intellectual Property Group 
1 1 00 New York Avenue, NW 
Ninth Floor 

Washington, DC 20005-3918 

Attorneys 
Telephone: (202) 861-3000 



This is a: 

□ Provisional Application 
^ Regular Utility Application 

□ Continuing Application 

□ The contents of the parent are incorporated 
by reference 

□ PCT National Phase Application 

□ Design Application 
n Reissue Application 

□ Plant Application 

□ Substitute Specification 

Sub. Spec Filed 

in App. No. I 



□ Marked up Specification re 

Sub. Spec, filed 

In App. No l_ 



SPECIFICATION 



30146269_1.DOC 



SUBSTANTIVE SPECTFTCATTON 



AstraZeneca Case No. 70667 I C& 
Specification for use in: OS/^ 
Claiming priority from UK Patent 
Application No. GB 0003553.5 
Filed on 17* February 2000 
and Application No. GB 0008376.6 
Filed on 6"^ April 2000 

Inventors: 

ANAND . Rakesh a Company Research 
Associate 

MORTEN . John Edward Nonris a 
Research Scientist 

and SMITH . John Craig a Team Leader; 
all of Alderley Park, Macclesfield, 
Cheshire, GB-SKIO 4TG; 

TITLE: 

METHODS 

APPLICANT: 

AstraZeneca AB, 

S-151 85 Sodertalje, Sweden. 



Z70667 



i -1 - 

METHODS 

This invention relates to polymorphisms in the human prostaglandin E2 receptor 1 
(EPl-R) gene and to corresponding novel allelic peptides encoded thereby. The invention 
5 also relates to methods and materials for analysing allelic variation in the EPl-R gene, and to 
the use of EPl-R polymorphism in the diagnosis and treatment of diseases in which 
modulation of EPl-R activity could be of therapeutic benefit, particularly disease states 
associated with pain such as rheumatoid arthritis, osteoarthritis and osteoporosis. 

Prostaglandins (prostaglandin D2, E2, F2 alpha and 12) and thromboxane A2 (TXA2) 

10 are members of a family of hormones termed prostanoids, formed during the metabolism of 
arachidonic acid by cyclooxygenases. Prostanoids can be produced by many tissues and cells 
in response to a variety of stimuli, show a wide range of effects and are involved in regulation 
of many biological functions. They display a broad spectrum of biological properties which 
include contraction and relaxation of smooth muscle (including blood vessels, bronchi, uterus, 

1 5 gastrointestinal tract), inhibition of gastric acid secretion, and effects on platelet aggregation 
and endocrine and metabohc processes. 

The biological actions of prostanoids are mediated through specific G-protein coupled 
cell surface receptors. The EPl receptor (EPl-R) is one of four receptor subtypes termed 
EPl, EP2, EPS and EP4, which mediate the biological activity of prostaglandin E2 (PGE2) 

20 (Negishi M et ah, J Lipid Mediators Cell Signalling 12, 379-391, 1995). Each of the receptor 
subtypes possesses a distinct physiological role; each binds PGE2 with high affinity but 
displays differences in binding affinity for various PGE2 agonists and antagonists, and each 
mediates its effects via different signal transduction pathways (Coleman R et al.. 
Pharmacological Reviews 46, 205-229, 1994). 

25 The EPl receptor has been located on a wide range of tissues including stomach, small 

intestine, kidney, eye, uterus, trachea and muscle, and has been specifically related to 
induction of pain (Syriatowicz J et al., Neuroscience 94, 587-594, 1999), fever (Oka K et al.. 
Am J Physiol 21 SI 6 44-46, 1998), diuresis and osmoregulation (Breyer et al.. Current Opinion 
in Nephrology and Hypertension 9/1, 23-29, 2000), initiation of labour (Spaziani et ah, 

30 Biology of Reproduction 62/1 , 23-26, 2000), and colon carcinogenesis (Watanabe et al.. 
Cancer Research 59/20 5093-5096, 1999). 
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Drugs which change the level of an EPl-R mediated response or change the biological 
activity of the EPl-R are useful in the treatment of all conditions in which the EPl-R plays a 
pathophysiological role. Such drugs are particularly useful in the treatment of conditions 
associated with pain, for example the pain associated with joint conditions (such as 
5 rheumatoid arthritis and osteoarthritis), post-operative pain, post-partum pain, the pain 
associated with dental conditions (such as dental caries and gingivitis), the pain associated 
with bums (including sunburn), the treatment of bone disorders (such as osteoporosis, 
hypercalcaemia of malignancy and Paget's disease), and the pain associated with sports 
injuries and sprains. 

1 0 Examples of EP 1 -R drugs are given in WO 97/00864 Zeneca Ltd. and WO 96/03 380 

Zeneca Ltd. 

A cDNA sequence encoding EPl-R has been cloned by Funk et al., J. Biol. Chem. 
268, 26767-26772, 1993. The cDNA sequence has been submitted to the EMBL database 
under accession number L22647. The chromosomal location of the EPl-R gene has been 

1 5 mapped to 1 9p 1 3 . 1 (Duncan et al. Genomics 25, 740-742, 1 995). 

The genomic DNA sequence of the human EPl-R gene has been published in the 
EMBLNEW database imder accession number AC008569. The database entry incorrectly 
assigns the chromosomal location of the hxxman EPl-R to chromosome 16. 

Fragments of EPl-R genomic DNA sequence corresponding to positions 1-374, 732- 

20 2274 and 2276-3908 of SEQ ID NO. 1 have been pubUshed under EMBL accession number 
AC008569. This database entry confirms the chromosomal location of the human EPl-R 
gene on chromosome 19. 

We have determined the full length genomic DNA sequence of the human EPl-R 
gene, disclosed in SEQ ID NO.l and shown in diagrammatic form in Figure I. By comparing 

25 the genomic sequence of SEQ ID NO. 1 with the cDNA sequence of L22647, we have 
identified some of the structural features of the EPl-R gene. There is an xmtranslated first 
exon at the 5' end of the gene. Exon 1 positions 1074-1130 in SEQ ID NO.l correspond to 
positions 1-57 as defined in EMBL accession number L22647. There is a first intron at 
positions 1131-2054 of SEQ ID NO.l followed by exon 2 at positions 2055-3013 of SEQ ID 

30 NO.l . The sequence of exon 2 corresponds to sequence positions 58-1016 as defined in 
EMBL accession number L22647. Exon 2 contains the initiating ATG codon at positions 
2072-2074 of SEQ ID NO.l, corresponding to positions 75-78 as defined in EMBL accession 
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number L22647. A second intron spans positions 3014-3565 of SEQ ID NO.l. Exon 3 is 
found at positions 3566-3908 of SEQ ID NO.l, corresponding to positions 1017-1359 as 
defined in EMBL accession number L22647. The intron/exon boundaries disclosed in SEQ 
ID NO. 1 are consistent with the AG/GT consensus sequence. A comparison of the sequences 
5 disclosed in SEQ ID No. 1 and EMBL accession number L22647 is shown in the Examples. 

All positions in the human EPl-R gene herein refer to the positions in SEQ ID NO.l 
unless stated otherwise or apparent from the context. 

DNA polymorphisms are variations in DNA sequence between one individual and 
another. DNA polymorphisms may lead to variations in amino acid sequence and 

1 0 consequently to altered protein structure and functional activity. Polymorphisms may also 
affect mRNA synthesis, maturation, transportation and stability. Polymorphisms which do 
not result in amino acid changes (silent polymorphisms) or which do not alter any known 
consensus sequences may nevertheless have a biological effect, for example by altering 
mRNA folding, stability, splicing, transcription rate, translation rate, or fideUty. Recently, it 

1 5 has been reported that even polymorphisms that do not result in an amino acid change can 
cause different structural folds of mRNA with potentially different biological functions (Shen 
et al, (1999) Proc Natl Acad Sci USA 96:7871-7876). 

WO 00/29614 (Eurona Medical Labs) - published after the priority dates of the present 
application - also discloses the identification of various polymorphisms in the EPl-R gene. 

20 Based on the published EPl-R sequence, their analysis predicts the existence of eight EPl-R 
polymorphisms. The present analysis has identified probable errors in the original published 
sequence for EPl-R as all individuals analysed differed from the published sequence at 
nucleotides 285, 763 and 764 of L22647. This suggests that the predicted polymorphisms at 
positions 2 11, 689 and 690 in WO 00/29614 reflect the existence of errors in the original 

25 published sequence for EPl-R gene rather than true polymorphisms. 

Knowledge of polymorphisms may be used to help identify patients most suited to 
therapy with particular pharmaceutical agents (this is often termed "pharmacogenetics"). 
Pharmacogenetics can also be used in pharmaceutical research to assist the drug selection 
process. Polymorphisms may be used in mapping the human genome and to elucidate the 

30 genetic component of diseases. The reader is directed to the following references for 

backgroimd details on pharmacogenetics and other uses of polymorphism detection: Linder et 
al. (1997), Clinical Chemistry, 43, 254; Marshall (1997), Nature Biotechnology, 15, 1249; 
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Intemational Patent Application WO 97/40462, Spectra Biomedical; and Schafer et al. (1998), 
Nature Biotechnology, 16, 33. 

Clinical trials have shown that patient response to treatment with pharmaceuticals is 
often heterogeneous. Thus there is a need for improved approaches to pharmaceutical agent 
5 design and therapy. 

The present invention is based on the discovery of fourteen polymorphisms in the 
human EPl-R gene. 

According to a first aspect of the present invention there is provided a method for the 

diagnosis of a pol3Tnorphism in an EPl-R gene in a human, which method comprises 
1 0 determining the sequence of the nucleic acid of the human at one or more of positions 344, 

621-627, 793-799, 908, 1136, 1160, 1189, 1458, 1656, 2448, 2531, 3348, 3432 and 3622 in 

the EPl-R gene as defined by the positions in SEQ ID NO.l; and determining the status of the 

human by reference to polymorphism in the EPl-R gene. 

The term human includes both a human having or suspected of having an EPl-R 
1 5 mediated disease and an asymptomatic human who may be tested for predisposition or 

susceptibility to such disease. At each position the human may be homozygous for an allele 

or the human may be a heterozygote. 

The term 'EPl-R mediated disease' means any disease in which changing the level of 

an EPl-R mediated response or changing the biological activity of the EPl-R would be of 
20 therapeutic benefit. 

The term 'EPl-R drug' means any drug which changes the level of an EPl-R mediated 

response or changes the biological activity of the EPl-R. For example the drug may be an 

agonist or an antagonist of a natural ligand for the EPl-R. A drug which inhibits the activity 

of the EPl-R is preferred. 
25 Variations in polypeptide sequence will be referred to as follows: original amino acid 

(using one or three letter nomenclature), position, new amino acid. For a hypothetical 

example "D25K" or "Asp25Lys" means that at position 25 an aspartic acid has been changed 

to lysine. 

The term polymorphism includes single nucleotide substitution, nucleotide insertion 
30 and nucleotide deletion, which in the case of insertion and deletion includes insertion or 

deletion of one or more nucleotides at a position of a gene and variable numbers of a repeated 
DNA sequence. 
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In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 344 in the EPl-R gene 
as defined by the positions in SEQ ID NO.l is presence of G and/or A. 

In another embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the polymorphism at position 621-627 in the EPl-R gene as defined by 
the positions in SEQ ID NO.l is deletion of one or more of the seven Gs. Preferably allelic 
variation consists of a single nucleotide deletion of one of the seven Gs. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism at position 793-799 in the EPl-R gene as defined by 
1 0 the positions in SEQ ID NO.l is deletion of one or more of the seven Gs. Preferably allelic 
variation consists of a single nucleotide deletion of one of the seven Gs. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism at position 908 in the EPl-R gene as defined by the 
positions in SEQ ID NO. 1 is presence of C and/or T. 
15 In another embodiment of the invention preferably the method for diagnosis described 

herein is one in which the polymorphism at position 1 136 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of G and/or C. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism at position 11 60 in the EPl-R gene as defined by the 
20 positions in SEQ ID NO. 1 is presence of T and/or C. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism at position 1 189 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of G and/or A. 

In another embodiment of the invention preferably the method for diagnosis described 
25 herein is one in which the polymorphism at position 1458 in the EPl-R gene as defined by the 
positions in SEQ ID NO. 1 is presence of A and/or G. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism at position 1656 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of T and/or G. 
30 In another embodiment of the invention preferably the method for diagnosis described 

herein is one in which the polymorphism at position 2448 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of T and/or C. 
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In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism at position 2531 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of G and/or A. 

In another embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the polymorphism at position 3348 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of C and/or T. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism at position 3432 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of C and/or G. 

10 In another embodiment of the invention preferably the method for diagnosis described 

herein is one in which the polymorphism at position 3622 in the EPl-R gene as defined by the 
positions in SEQ ID NO.l is presence of G and/or A. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one selected from the group in which, as defined by the positions in SEQ ID NO. 1 : 

1 5 the polymorphism at position 344 in the EP 1-R gene is presence of G and/or A, the 

polymorphism at position 621-627 in the EPl-R gene is deletion of one or more of the seven 
Gs, the polymorphism at position 793-799 in the EPl-R gene is deletion of one or more of the 
seven Gs, the polymorphism at position 908 in the EPl-R gene is presence of C and/or T, the 
polymorphism at position 1 136 in the EPl-R gene is presence of G and/or C, the 

20 polymorphism at position 1 160 in the EPl-R gene is presence of T and/or C, the 
polymorphism at position 1 189 in the EPl-R gene is presence of G and/or A, the 
polionorphism at position 1458 in the EPl-R gene is presence of A and/or G, the 
polymorphism at position 1656 in the EPl-R gene is presence of T and/or G, the 
polymorphism at position 2448 in the EPl-R gene is presence of T and/or C, the 

25 polymorphism at position 253 1 in the EP 1-R gene is presence of G and/or A, the 
polymorphism at position 3348 in the EPl-R gene is presence of C and/or T, the 
polymorphism at position 3432 in the EPl-R gene is presence of C and/or G and, the 
polymorphism at position 3622 in the EPl-R gene is presence of G and/or A. 

It will be appreciated by the person skilled in the art that the numbering of the 

30 nucleotide positions in the EPl-R gene will vary according to the number of deletions at 
positions 621-627 and 793-799 as defined by the sequence in SEQ ID NO. 1. For example, in 
a first allele comprising 7xG nucleotides at positions 621-627 of SEQ ID NO.l the nucleotide 
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sequence immediately following the 7xG will start at position 628. In a second allele where 
one of the G's at positions 621-627 of SEQ ID NO.l is deleted, the nucleotide sequence 
immediately following the remaining 6xG's will now start at position number 627 and so 
forth. 

5 The method for diagnosis is preferably one in which the sequence is determined by a 

method selected from amplification refractory mutation system (ARMS™-allele specific 
amplification), allele specific hybridisation (ASH), oligonucleotide ligation assay (OLA) and 
restriction fragment length polymorphism (RFLP). The amino acid sequence method for 
diagnosis is preferably one which is determined by immunological methods such as enzyme 

1 0 linked immunosorbent assay (ELIS A) . 

In another aspect of the invention there is provided a method of analysing a nucleic 
acid, comprising: obtaining a nucleic acid from an individual; and determining the base 
occupying any one of the following polymorphic sites: 344, 621-627, 793-799, 908, 1136, 
1160, 1189, 1458, 1656, 2448, 2531,3348, 3432 and 3622 in the EPl-R gene as defined by 

1 5 the positions in SEQ ID NO. 1 . 

In another aspect of the invention we provide a method for the diagnosis of EPl-R- 
mediated disease, which method comprises: 

i) obtaining sample nucleic acid from an individual, 

ii) detecting the presence or absence of a variant nucleotide at one or more of positions 344, 
20 621-627, 793-799, 908, 1136, 1160, 1189, 1458, 1656, 2448, 2531, 3348, 3432 and 3622 

in the EP 1 -R gene as defined by the positions in SEQ ID NO. 1 ; and 

iii) determining the status of the individual by reference to polymorphism in the EPl-R gene. 

Allelic variation at each position in the EPl-R gene, including preferred variation is 
described herein. 

25 The status of the individual may be determined by reference to allelic variation at any 

one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen or all fourteen 
positions optionally in combination with any other polymorphism in the gene that is (or 
becomes) known. 

The test sample of nucleic acid is conveniently present in a sample of blood, sputum, 
30 skin, bronchoalveolar lavage fluid, or other body fluid or tissue obtained from an individual. 
It will be appreciated that the test sample may equally comprise a nucleic acid sequence 
corresponding to the sequence in the test sample, that is to say that all or a part of the region 
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in the sample nucleic acid may firstly be amplified using any convenient technique e.g. PGR, 
before analysis of allelic variation. 

It will be apparent to the person skilled in the art that there are a large number of 
analytical procedures which may be used to detect the presence or absence of variant 

5 nucleotides at one or more polymorphic positions of the invention. In general, the detection 
of allelic variation requires a mutation discrimination technique, optionally an amplification 
reaction and optionally a signal generation system. Table 1 hsts a number of mutation 
detection techniques, some based on PGR. These may be used in combination with a number 
of signal generation systems, a selection of which is listed in Table 2. Further amphfication 

1 0 techniques are listed in Table 3. Many current methods for the detection of allelic variation 
are reviewed by NoUau et aL, Glin. Chem. 43, 1 1 14-1120, 1997; and in standard textbooks, 
for example "Laboratory Protocols for Mutation Detection", Ed. by U. Landegren, Oxford 
University Press, 1996 and "PGR", 2"^ Edition by Newton & Graham, BIOS Scientific 
Pubhshers Limited, 1997. 



15 Abbreviations: 



ALEX™ 


Amplification refractory mutation system linear extension 


APEX 


Arrayed primer extension 


ARMS™ 


Amphfication refractory mutation system 


ASA 


Allele specific amphfication 


b-DNA 


Branched DNA 


CMC 


Chemical mismatch cleavage 


COPS 


Competitive oligonucleotide priming system 


DGGE 


Denaturing gradient gel electrophoresis 


FRET 


Fluorescence resonance energy transfer 


LGR 


Ligase chain reaction 


MASDA 


Multiple allele specific diagnostic assay 


NASBA 


Nucleic acid sequence based amplification 


OLA 


Oligonucleotide Hgation assay 


PGR 


Polymerase chain reaction 


PTT 


Protein truncation test 


RFLP 


Restriction fragment length polymorphism 
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SDA 


Strand displacement amplification 


SERRS 


Surface enhanced raman resonance spectroscopy 


SNP 


Single nucleotide polymorphism 


SSCP 


Single-strand conformation polymorphism analysis 


SSR 


Self sustained repHcation 


TGGE 


Temperature gradient gel electrophoresis 



Table 1 - Mutation Detection Techniques 

General: DNA sequencing. Sequencing by hybridisation 

Scanning: PTT*, SSCP, DGGE, TGGE, Cleavase, Heteroduplex analysis, CMC, Enzymatic 
5 mismatch cleavage 

* Note: not useful for detection of promoter polymorphisms. 
Hybridisation Based: 

Solid phase hybridisation: Dot blots, MASDA, Reverse dot blots, Oligonucleotide arrays 
(DNA Chips). 

1 0 Solution phase hybridisation: Taqman™ - US-52 1 00 1 5 & US-5487972 (Hoffinann-La 
Roche), Molecular Beacons - Tyagi et al (1996), Nature Biotechnology, 14, 303; WO 
95/13399 (PubHc Health Inst., New York). 

Extension Based: ARMS™-allele specific amplification (as described in European patent 
No. EP-B-332435 and US patent No. 5,595,890), ALEX^m , European Patent No. EP 332435 
15 Bl (Zeneca Limited), COPS - Gibbs et al (1989), Nucleic Acids Research, 17, 2347. 
Incorjporation Based: Mini-sequencing, APEX 
Restriction Enzyme Based: RFLP, Restriction site generating PCR 
Ligation Based: OLA 
Other: hivader assay 

20 

Table 2 - Signal Generation or Detection Systems 

Fluorescence: FRET, Fluorescence quenching. Fluorescence polarisation - United Kingdom 
Patent No. 2228998 (Zeneca Limited) 

Other: Chemiluminescence, Electrochemiluminescence, Raman, Radioactivity, Colorimetric, 
25 Hybridisation protection assay. Mass spectrometry and SERRS - WO 97/05280 (University of 
Strathclyde). 
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Table 3 - Further Amplification Methods 
SSR, NASBA, LCR, SDA, b-DNA 

5 Preferred mutation detection techniques include ARMS™-ASA, ALEX^m^ COPS, 

Taqman, Molecular Beacons, RFLP, restriction site based PGR and FRET techniques, 
polyacrylamide gel electrophoresis and capillary electrophoresis. 

Particularly preferred methods include ARMS™- ASA and RFLP based methods. 
ARMS™- AS A is an especially preferred method. 

1 0 ARMS™-allele specific amplification (described in European patent No. EP-B-332435, 
US patent No. 5,595,890 and Newton et al. (Nucleic Acids Research, Vol. 17, p.2503; 1989)), 
relies on the complementarity of the 3' terminal nucleotide of the primer and its template. 
The 3' terminal nucleotide of the primer being either complementary or non-complementary 
to the specific mutation, allele or polymorphism to be detected. There is a selective advantage 

1 5 for primer extension fl:om the primer whose 3' terminal nucleotide complements the base 
mutation, allele or polymorphism. Those primers which have a 3' terminal mismatch with the 
template sequence severely inhibit or prevent enzymatic primer extension. Polymerase chain 
reaction or unidirectional primer extension reactions therefore result in product amplification 
when the 3' terminal nucleotide of the primer complements that of the template, but not, or at 

20 least not efficiently, when the 3' terminal nucleotide does not complement that of the 
template. 

In a fixrther aspect, the diagnostic methods of the invention are used to assess the 
efficacy of therapeutic compounds in the treatment of EPl-R-mediated diseases particularly 
disease states associated with pain such as rheumatoid arthritis, osteoarthritis and 
25 osteoporosis. 

Assays, for example reporter-based assays, may be devised to detect whether one or 
more of the above polymorphisms affect transcription levels and/or message stability. 

The polymorphisms identified in the present invention that occur in intron regions or 
in the promoter region are not expected to alter the amino acid sequence of the EPl -receptor, 
30 but may affect the transcription and/or message stability of the sequences and thus affect the 
level of the receptors in cells. 
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Two of the polymorphisms of the present invention result in variations in amino acid 
sequence in the translated protein. Polymorphism at position 2448 as defined in SEQ ID 
NO.l results in an amino acid change jfrom leucine to proUne at corresponding position 126 of 
the translated protein (Leul26Pro). Polymorphism at position 2531 as defined in SEQ ID 
5 NO.l results in an amino acid change from alanine to threonine at corresponding position 154 
of the translated protein (Alal54Thr). Changes in protein sequence may be detected using 
standard techniques known to the person skilled in the art such as immunoassay techniques 
that employing specific antibodies capable of discriminating peptide regions that differ by one 
or more amino acids. Detection of the amino acid changing polymorphisms also forms part of 
1 0 this invention. 

Thus, according to a further aspect of the invention there is provided a method for 
delecting the presence of a polymorphism m the EPl-R protein in a human, which method 
comprises determining the amino acid present at one or both of amino acid positions 126 and 
154 of the EPl-R protein; and determining the status of the individual by reference to the 
1 5 amino acid detected. 

In another aspect of the invention there is provided a method for the diagnosis of EPl-R- 
mediated disease, which method comprises: 

i) obtaining a protein containing sample from an individual; 

ii) detecting the presence or absence of a variant EPl-R polypeptide on the basis of the 

20 presence of a polymorphic amino acid at either or both amino acid positions: 126 and 154; 
and, 

iii) determining the status of the human by reference to the presence or absence of a 
polymorphism in EPl-R protein. 

In a preferred embodiment the polymorphic amino acid at position 126 is presence of 
25 proline and at position 154 is presence of threonine. 

Individuals who carry particular allelic variants of the EPl-R gene may exhibit 
differences in their ability to regulate protein biosynthesis under different physiological 
conditions and may display altered abilities to react to different diseases. In addition, 
differences in protein regulation and/or the protein's properties arising as a result of allelic 
30 variation may have a direct effect on the response of an individual to drug therapy. The 

diagnostic methods of the invention may be useful both to predict the clinical response to such 
agents and to determine therapeutic dose. 
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In a further aspect, the diagnostic methods of the invention, are used to assess the 
predisposition of an individual to diseases mediated by EPl-R. This may be particularly 
relevant in the development of pain and in diseases which are mediated by EPl-R. The 
present invention may be used to recognise individuals who are particularly at risk from 
5 developing these conditions. 

Low frequency polymorphisms may be particularly useful for haplotyping as 
described below. A haplotype is a set of alleles found at linked polymorphic sites (such as 
within a gene) on a single (paternal or maternal) chromosome. If recombination within the 
gene is random, there may be as many as 2" haplotypes, where 2 is the number of alleles at 
1 0 each polymorphic position and n is the number of polymorphic positions. One approach to 
identifying mutations or polymorphisms which are correlated with clinical response is to carry 
out an association study using all the haplotypes that can be identified in the population of 
interest. The frequency of each haplotype is limited by the frequency of its rarest allele, so 
that polymorphisms with low frequency alleles are particularly usefiil as markers of low 
1 5 frequency haplotypes. As particular mutations or polymorphisms associated with certain 
clinical features, such as adverse or abnormal events, are likely to be of low frequency within 
the population, low frequency polymorphisms may be particularly useful in identifying these 
mutations (for examples see: De Stefano V et al., Ann Hum Genet (1998) 62:481-90; and 
Keightley AM et al., Blood (1999) 93:4277-83. 
20 In a further aspect, the diagnostic methods of the invention are used in the 

development of new drug therapies which selectively target one or more allelic variants of the 
EPl-R gene. Identification of a link between a particular allelic variant and predisposition to 
disease development or response to drug therapy may have a significant impact on the design 
of new drugs. Drugs may be designed to regulate the biological activity of variants implicated 
25 in the disease process whilst minimising effects on other variants. 

In a further diagnostic aspect of the invention the presence or absence of variant 
nucleotides is detected by reference to the loss or gain of, optionally engineered, sites 
recognised by restriction enzymes (see Example 3). 

According to another aspect of the present invention there is provided a nucleic acid 
30 comprising any one of the following polymorphisms: 

the nucleic acid of SEQ ID NO.l with A at position 344 as defined by the positions in SEQ ID 
NO.l; 
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the nucleic acid of SEQ ID NO.l with six Gs at positions 621-626 as defined by the positions 
in SEQ ID NO.l; 

the nucleic acid of SEQ ID NO.l with six Gs at positions 793-798 as defined by the positions 
in SEQ ID NO.l; 

5 the nucleic acid of SEQ ID NO. 1 with T at position 908 as defined by the positions in SEQ ID 
NO.l; 

the nucleic acid of SEQ ID NO.l with C at position 1 136 as defined by the positions in SEQ 
ID NO.l; 

the nucleic acid of SEQ ID NO. 1 with C at position 1 1 60 as defined by the positions in SEQ 
10 ID NO.l; 

the nucleic acid of SEQ ID NO.l with A at position 1189 as defined by the positions in SEQ 
ID NO.l; 

the nucleic acid of SEQ ID NO.l with G at position 1458 as defined by the positions in SEQ 
ID NO.l; 

1 5 the nucleic acid of SEQ ID NO. 1 with G at position 1 656 as defined by the positions in SEQ 
ID NO.l; 

the nucleic acid of SEQ ID NO.l with C at position 2448 as defined by the positions in SEQ 
ID NO.l; 

the nucleic acid of SEQ ID NO. 1 with A at position 253 1 as defined by the positions in SEQ 
20 ID NO.l; 

the nucleic acid of SEQ ID NO. 1 with T at position 3348 as defined by the positions in SEQ 
ID NO.l; 

the nucleic acid of SEQ ID NO. 1 with G at position 3432 as defined by the positions in SEQ 
ID NO.l; 

25 the nucleic acid of SEQ ID NO.l with A at position 3622 as defined by the positions in SEQ 
ID NO.l; 

or a complementary strand thereof or an antisense sequence thereto or a fi-agment thereof of at 
least 17 bases comprising at least one polymorphism. 

Fragments are at least 17 bases, more preferably at least 20 bases, more preferably at 
30 least 30 bases. 
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Novel sequence disclosed herein, may be used in another embodiment of the 
invention to regulate expression of the gene in cells by the use of antisense constructs. To 
enable methods of down-regulating expression of the gene of the present invention in 
mammalian cells, an example antisense expression construct can be readily constructed for 
5 instance using the pREPlO vector (Invitrogen Corporation). Transcripts are expected to 
inhibit translation of the gene in cells transfected with this type of construct. Antisense 
transcripts are effective for inhibiting translation of the native gene transcript, and capable of 
inducing the effects (e.g., regulation of tissue physiology) herein described. Oligonucleotides 
which are complementary to and hybridisable with any portion of novel gene mRNA 
1 0 disclosed herein are contemplated for therapeutic use. U.S. Patent No. 5,639,595, 

"Identification of Novel Drugs and Reagents", issued Jun. 17, 1997, wherein methods of 
identifying oligonucleotide sequences that display in vivo activity are thoroughly described, 
is herein incorporated by reference. Expression vectors containing random ohgonucleotide 
sequences derived from previously known polynucleotides are transformed into cells. The 
1 5 cells are then assayed for a phenotype resulting from the desired activity of the 

oligonucleotide. Once cells with the desired phenotype have been identified, the sequence of 
the oligonucleotide having the desired activity can be identified. Identification may be 
accomplished by recovering the vector or by polymerase chain reaction (PGR) amplification 
and sequencing the region containing the inserted nucleic acid material. Antisense molecules 
20 can be synthesised for antisense therapy. These antisense molecules may be DNA, stable 
derivatives of DNA such as phosphorothioates or methylphosphonates, RNA, stable 
derivatives of RNA such as 2'-0-alkylElNA, or other oligonucleotide mimetics. U.S. Patent 
No. 5,652,355, "Hybrid Ohgonucleotide Phosphorothioates", issued July 29, 1997, and U.S. 
Patent No. 5,652,356, "Inverted Chimeric and Hybrid Oligonucleotides", issued July 29, 
25 1997, which describe the synthesis and effect of physiologically-stable antisense molecules, 
are incorporated by reference. Antisense molecules may be introduced into cells by 
microinjection, liposome encapsulation or by expression from vectors harboring the antisense 
sequence. 

The invention further provides nucleotide primers which can detect the 
30 polymorphisms of the invention. 

According to another aspect of the present invention there is provided an allele 
specific primer capable of detecting an EPl-R gene polymorphism at one or more of positions 
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344, 621-627, 793-799, 908, 1136, 1160, 1189, 1458, 1656, 2448, 2531, 3348, 3432, and 
3622 in the EPl-R gene as defined by the positions in SEQ ID NO.l. 

An allele specific primer is used, generally together with a constant primer, in an 
amplification reaction such as a PGR reaction, which provides the discrimination between 
5 alleles through selective ampUfication of one allele at a particular sequence position e.g. as 
used for ARMS™-allele specific amplification assays. The allele specific primer is preferably 
17- 50 nucleotides, more preferably about 17-35 nucleotides, more preferably about 17-30 
nucleotides. 

An allele specific primer preferably corresponds exactly with the allele to be detected 
1 0 but derivatives thereof are also contemplated wherein about 6-8 of the nucleotides at the 3' 
terminus correspond with the allele to be detected and wherein up to 10, such as up to 8, 6, 4, 
2, or 1 of the remaining nucleotides may be varied without significantly affecting the 
properties of the primer. 

Primers may be manufactured using any convenient method of synthesis. Examples of 
1 5 such methods may be found in standard textbooks, for example "Protocols for 

Oligonucleotides and Analogues; Synthesis and Properties," Methods in Molecular Biology 
Series; Volume 20; Ed. Sudhir Agrawal, Humana ISBN: 0-89603-247-7; 1993; V Edition. If 
required the primer(s) may be labelled to facihtate detection. 

According to another aspect of the present invention there is provided an allele- 
20 specific oligonucleotide probe capable of detecting an EP 1 -R gene polymorphism at one or 
more of positions 344, 621-627, 793-799, 908, 1136, 1160, 1189, 1458, 1656, 2448, 2531, 
3348, 3432 and 3622 in the EPl-R gene as defined by the positions in SEQ ID NO.l . 

The allele-specific oligonucleotide probe is preferably 17- 50 nucleotides, more 
preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 
25 The design of such probes will be apparent to the molecular biologist of ordinary skill. 

Such probes are of any convenient length such as up to 50 bases, up to 40 bases, more 
conveniently up to 30 bases in length, such as for example 8-25 or 8-15 bases in length. In 
general such probes will comprise base sequences entirely complementary to the 
corresponding wild type or variant locus in the gene. However, if required one or more 
30 mismatches may be introduced, provided that the discriminatory power of the oligonucleotide 
probe is not unduly affected. The probes of the invention may carry one or more labels to 
facilitate detection. 



Z70667 

-16- 

Oligonucleotide probes and primers generally differ according to the location of the 
base capable of hybridising with the polymorphic base. Thus, with a probe, the nucleotide 
capable of complementing the polymorphism is located in a centralised position, whereas with 
a primer (to be used in an amphfication reaction) the nucleotide complementing the 
5 polymorphism is preferably located at the 3 ' end of the primer. 

According to another aspect of the present invention there is provided a diagnostic kit 
comprising an allele specific ohgonucleotide probe of the invention and/or an allele-specific 
primer of the invention. 

The diagnostic kits may comprise appropriate packaging and instructions for use in the 
1 0 methods of the invention. Such kits may fiirther comprise appropriate buffer(s), nucleotides, 
and polymerase(s) such as thermostable polymerases, for example taq polymerase. 

In another aspect of the invention, the polymorphisms of this invention may be used as 
genetic markers in linkage studies. This particularly appUes to the polymorphisms at 
positions 621-627, 1 136, 1189, 1458, 3348 and 3622 in the EPl-R gene as defined by the 
1 5 positions in SEQ ID NO. 1 because of their relatively high fi-equency (See Example 1). 

According to another aspect of the present invention there is provided a method of 
treating a human in need of treatment with an EPl-R drug in which the method comprises: 

i) diagnosis of a polymorphism in the EP 1 -R gene in the human, which diagnosis 
comprises determining the sequence of the nucleic acid at one or more of positions 344, 621- 

20 627, 793-799, 908, 1136, 1160, 1189, 1458, 1656, 2448, 2531, 3348, 3432 and 3622 in the 
EPl-R gene as defined by the positions in SEQ ID NO.l, and determining the status of the 
human by reference to polymorphism in the EPl-R gene; and 

ii) administering an effective amount of a EP 1-R drug. 

Preferably determination of the status of the human is clinically usefiil. Examples of 
25 chnical usefiilness include deciding which drug or drugs to administer and/or establishing the 
effective amount of the drug or drugs. 

Drugs which decrease the activity of EPl-R are of value in a number of disease 
conditions, including disease states associated with pain such as rheumatoid arthritis, 
osteoarthritis and osteoporosis. 
30 According to another aspect of the present invention there is provided use of an EP 1 -R 

drug in the preparation of a medicament for treating an EPl R-mediated disease in a human 
diagnosed as having a polymorphism at one or more of positions 344, 621-627, 793-799, 908, 
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1136, 1160, 1189, 1458, 1656, 2448,2531,3348, 3432 and 3622 in the EPl-R gene as 
defined by the positions in SEQ ID NO.l. 

According to another aspect of the present invention there is provided a 
pharmaceutical pack comprising an EPl-R drug and instructions for administration of the 
5 drug to humans diagnostically tested for a polymorphism at one or more of positions 344, 
621-627, 793-799, 908, 1136, 1160, 1189, 1458, 1656, 2448, 2531, 3348, 3432 and 3622 in 
the EPl-R gene as defined by the positions in SEQ ID NO.l. 

According to another aspect of the present invention there is provided a computer 
readable medium comprising at least one novel polynucleotide sequence of the invention 

1 0 stored on the medivim. The computer readable medium may be used, for example, in 

homology searching, mapping, haplotyping, genotyping or pharmacogenetic analysis or any 
other bioinformatic analysis. The reader is referred to Bioinformatics, A practical guide to the 
analysis of genes and proteins. Edited by A D Baxevanis & B F F Ouellette, John Wiley & 
Sons, 1998. Any computer readable medium may be used, for example, floppy disks, tapes, 

1 5 chips, compact disks, digital disks, video disks, punch cards and hard drives. 

The pol3aiucleotide sequences of the invention, or parts thereof, particularly those 
relating to and identifying the polymorphisms identified herein represent a valuable 
information source, for example, to characterise individuals in terms of haplotype and other 
sub-groupings, such as investigation of susceptibility to treatment with particular drugs. 

20 These approaches are most easily facilitated by storing the sequence information in a 
computer readable medium and then using the information in standard bioinformatics 
programs or to search sequence databases using state of the art searching tools such as "GCC" 
(Genetics Computer Group), BlastX, BlastP, BlastN, FASTA (refer to Altschul et al. (1990) J. 
Mol. Biol. 215:403-410). Thus, the polynucleotide sequences of the invention are particularly 

25 useful as components in databases useful for sequence identity and other search analyses. As 
used herein, storage of the sequence information in a computer readable mediimi and use in 
sequence databases in relation to 'polynucleotide or polynucleotide sequence of the invention' 
covers any detectable chemical or physical characteristic of a polynucleotide of the invention 
that may be reduced to, converted into or stored in a tangible medium, such as a computer 

30 disk, preferably in a computer readable form. For example, chromatographic scan data or 
peak data, photographic scan or peak data, mass spectrographic data, sequence gel (or other) 
data. 
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The invention provides a computer readable medium having stored thereon one or 
more polynucleotide sequences of the invention. For example, a computer readable medium 
is provided comprising and having stored thereon a member selected from the group 
consisting of: a poljniucleotide comprising the sequence of a polynucleotide of the invention, 
5 a polynucleotide consisting of a polynucleotide of the invention, a polynucleotide which 
comprises part of a polynucleotide of the invention, which part includes at least one of the 
polymorphisms of the invention, a set of polynucleotide sequences wherein the set includes at 
least one polynucleotide sequence of the invention, a data set comprising or consisting of a 
polynucleotide sequence of the invention or a part thereof comprising at least one of the 

1 0 polymorphisms identified herein. 

Thus, according to another aspect of the invention there is provided a computer 
readable medium having stored thereon a nucleic acid sequence comprising at least 17, 
preferably at least 20 consecutive bases of the EPl-R gene sequence, which sequence includes 
at least one of the polymorphisms at positions: 344, 621-627, 793-799, 908, 1136, 1 160, 1189, 

1 5 1458, 1656, 2448, 253 1 , 3348, 3432 and 3622 in the EPl-R gene as defined by the positions 
inSEQIDNO.l. 

A computer based method is also provided for performing sequence identification, 
said method comprising the steps of providing a polynucleotide sequence comprising a 
polymorphism of the invention in a computer readable medium; and comparing said 
20 polymorphism containing polynucleotide sequence to at least one other polynucleotide or 
polypeptide sequence to identify identity (homology), i.e. screen for the presence of a 
polymorphism. 

In another aspect of the invention there is provided a method for performing sequence 
identification, said method comprising the steps of providing a nucleic acid sequence 

25 comprising at least 20 consecutive bases of the EPl-R gene sequence, which sequence 
includes at least one of the polymorphisms at positions: 344, 621-627, 793-799, 908, 1 136, 
1160, 1189, 1458, 1656, 2448, 2531,3348, 3432 and 3622 in the EPl-R gene as defined by 
the positions in SEQ ID NO.l, in a computer readable medium; and comparing said nucleic 
acid sequence to at least one other nucleic acid sequence to identify identity. 

30 Two of the polymorphisms of the present invention result m variations in amino acid 

sequence in the translated protein. Polymorphism at position 2448 as defined in SEQ ID 
NO.l results in an amino acid change ft-om leucine to proUne at corresponding position 126 of 
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the translated protein (Leul26Pro). Polymorphism at position 2531 as defined in SEQ ID 
NO. 1 results in an amino acid change from alanine to threonine at corresponding position 1 54 
of the translated protein (Alal54Thr). 

Thus according to another aspect of the present invention there is provided an allelic 
5 variant of the human EP 1 -R polypeptide having a proline at position 1 26 and/or a threonine at 
position 154 or a fragment thereof comprising at least 10 amino acids provided that the 
fragment comprises the allelic variant at position 126 and/or position 154. Preferably the 
allelic variant is at least 30% pure, more preferably at least 60% pure, more preferably at least 
90% pure, more preferably at least 95% pure, and more preferably at least 99% pure. 

10 Fragments of EPl-R polypeptide are at least 10 amino acids, more preferably at least 

15 amino acids, more preferably at least 20 amino acids. The polypeptides of the invention do 
not encompass naturally occurring polypeptides as they occur in nature, for example, the 
polypeptide is at least partially purified from at least one component with which it occurs 
naturally. Preferably the polypeptide is at least 30% pure, more preferably at least 60% pure, 

1 5 more preferably at least 90% pure, more preferably at least 95% pure, and more preferably at 
least 99% pure. 

According to another aspect of the present invention there is provided an antibody 
specific for an alleUc variant of human EPl-R polypeptide having a proline at position 126 
and/or a threonine at position 154 or a fragment thereof comprising at least 10 amino acids 

20 provided that the fragment comprises the allelic variants at position 126 and /or position 1 54. 
Antibodies can be prepared using any suitable method. For example, purified 
polypeptide may be utilised to prepare specific antibodies. The term "antibodies" includes 
polyclonal antibodies, monoclonal antibodies, and the various types of antibody constructs 
such as for example F(ab')2, Fab and single chain Fv. Antibodies are defined to be 

25 specifically binding if they bind the antigen with a of greater than or equal to about 10' 
M"'. Affinity of binding can be determined using conventional techniques, for example those 
described by Scatchard et al., Ann. N.Y. Acad. Set, 51:660 (1949). 

Polyclonal antibodies can be readily generated from a variety of sources, for example, 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice or rats, using procedures that are 

30 well-known in the art. In general, antigen is administered to the host animal typically through 
parenteral injection. The immunogenicity of antigen may be enhanced through the use of an 
adjuvant, for example, Freund's complete or incomplete adjuvant. Following booster 
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immunisations, small samples of serum are collected and tested for reactivity to antigen. 
Examples of various assays useful for such determination include those described in: 
Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory 
Press, 1988; as well as procedures such as countercurrent immuno-electrophoresis (CIEP), 
5 radioimmunoassay, radioimmunoprecipitation, enzyme-linked immuno-sorbent assays 

(ELISA), dot blot assays, and sandwich assays, see U.S. Patent Nos. 4,376,110 and 4,486,530. 

Monoclonal antibodies may be readily prepared using well-known procedures, see for 
example, the procedures described in U.S. Patent Nos. RE 32,011, 4,902,614, 4,543,439 and 
4,411,993; Monoclonal Antibodies, Hybridomas: A New Dimension in Biological Analyses, 

1 0 Plenum Press, Kennett, McKeam, and Bechtol (eds.), (1980). 

The monoclonal antibodies of the invention can be produced using alternative 
techniques, such as those described by Alting-Mees et al., "Monoclonal Antibody Expression 
Libraries: A Rapid Alternative to Hybridomas", Strategies in Molecular Biology 3: 1-9 (1990) 
which is incorporated herein by reference. Similarly, binding partners can be constructed 

1 5 using recombinant DNA techniques to incorporate the variable regions of a gene that encodes 
a specific binding antibody. Such a technique is described in Larrick et al.. Biotechnology, 7: 
394 (1989). 

Once isolated and purified, the antibodies may be used to detect the presence of 
antigen in a sample using estabUshed assay protocols. 
20 The invention will now be illustrated but not limited by reference to the following 

Examples and Figure. All temperatures are in degrees Celsius. 

In the Examples below, unless otherwise stated, the following methodology and 
materials have been applied. 

AMPLITAQ™ or AMPLITAQ GOLD™ available from Perkin-Ehner Cetus, are used 
25 as the source of thermostable DNA polymerase. 

General molecular biology procedures can be followed from any of the methods 
described in "Molecular Cloning - A Laboratory Manual" Second Edition, Sambrook, Fritsch 
and Maniatis (Cold Spring Harbor Laboratory, 1989) or "Current Protocols in Molecular 
Biology Volumesl-3 ,Edited by FM Asubel, R Brent, RE Kingston pub John Wiley 1998. 
30 Electropherograms were obtained in a standard manner: data was collected by ABI377 

data collection software and the wave form generated by ABI Prism™ sequencing analysis 
(2.1.2). 
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Figure 1 

Genomic DNA sequence of the human EPl-R gene. 

5 Example 1 

Identiflcation of Polymorphisms 
1. Methods 

Genomic DNA Preparation 

Genomic DNA was prepared from l3miphoblastoid cell lines from Caucasian donors 
1 0 following protocol I (Molecular Cloning: A Laboratory Manual, p392, Sambrook, Fritsch and 
Maniatis, 2nd Edition, Cold Spring Harbor Press, 1989) with the following modifications. 
Samples were extracted with phenol, then phenol/chloroform and then chloroform rather than 
with three phenol extractions. The DNA was dissolved in deionised water. 
Template Preparation 
1 5 Templates were prepared by PCR. The extension temperature was 72° and 

denaturation temperature 94°; each step was 1 minute. Generally 50 ng genomic DNA was 
used in each reaction and subjected to 40 cycles of PCR. 

For dye-primer sequencing the forward primers were modified to include Ml 3 forward 
sequence (ABI protocol P/N 4021 14, Applied Biosystems) at the 5' end of the 
20 oligonucleotides. 

Dye Primer Sequencing 

Dye-primer sequencing using Ml 3 forward primer was as described in the ABI 
protocol P/N 4021 14 for the ABI Prism™ dye primer cycle sequencing core kit with 
"AmpliTaq FS"^^ DNA polymerase, modified in that the annealing temperature was 45° and 
25 DMSO was added to the cycle sequencing mix to a final concentration of 5%. 

The extension reactions for each base were pooled, ethanol/sodium acetate 
precipitated, washed and resuspended in formamide loading buffer. 

4.25% Acrylamide gels were run on an automated sequencer (ABI 377, Applied 
Biosystems). 

30 
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2. Results 
Novel Polymorphisms 



Position 


Reference 


Region 


Refer en c 
e Allele 


Second 
Allele 


Effect 


RFLP 


Second 
Allele 
Frequency 


344 


SEQID 
NO.l 




G 


A 


loss ofSPl 
site 




3/54 


621-627 


SEQID 
NO.l 




7xG 


6xG 






32/54 


793-799 


SEQID 
NO.l 




7xG 


6xG 






5/52 


908 


SEQID 
NO.l 




C 


T 


loss ofSPl 
site 




1/46 


1136 


SEQID 
NO.l 


intron 1 


G 


C 






11/34 


1160 


SEQID 
NO.l 


intron 1 


T 


C 




+ eng Bfa I 


2/40 


1189 


SEQID 
NO.l 


intron 1 


G 


A 






12/40 


1458 


SEQID 
NO.l 


intron 1 


A 


G 




+ eng Apa 
I 


19/38 


1656 


SEQID 
NO.l 


intron 1 


T 


G 




- eng Msc 
I 


2/40 


2448 


SEQID 
NO.l 


exon2 


T 


C 


Leul26Pro 


+ eng Sac 

n 


2/52 


2531 


SEQID 
NO.l 


exon2 


G 


A 


Alal54Thr 


- Bss HI 


1/52 


3348 


SEQID 
NO.l 


intron 2 


C 


T 






17/50 



Z70667 



-23- 



3432 


SEQID 
NO.l 


intron 2 


C 


G 




+ Nla III 


7/40 


3622 


SEQID 
NO.l 


exon 3 


G 


A 


silent 
poljonorphis 
m 


+ eng Spe 
I 


12/38 



Frequency is the allele frequency of the second allele in control subjects. 



Example 2 

5 Determination of the full length genomic DNA sequence of the human EPl-R gene 

The full length genomic DNA sequence of the human EPl-R gene was obtained by 
PGR. Intron sequences were obtained by PGR from the flanking cDNA sequence. The 
genomic sequence 1-1073 was obtained by vectorette PGR (Riley et al, Nuc. Acid Res. 18, 
10 2887-2890, 1990). 



Comparison of corresponding regions of sequence in SEQ ID NO.l andEMBL L22647 



SEP TP NO.l EMBLL22647 

position number position number 
1074-1130 1-57 
2055-3013 58-1016 
3566-3908 1017-1359 



1 5 Identification of errors in published EPl-R gene sequences 

By carrying out a detailed comparison of the full length genomic DNA sequence of the EPl-R 
gene as defined in SEQ ID NO.l with the cDNA sequence pubhshed in EMBL L22647 and 
the genomic DNA sequences published in EMBL AC008569, we have identified the 
20 following errors in the pubhshed sequences, some of which resuh in changes to the pubhshed 
amino acid sequence. 
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Position as 


Published 


Sequence as 


Codon 


Amino acid 


dcfificd in 


sequence 


deteTtnined in 


change 


change 


L22647 


L22647 


SEQIDNO.l 






285 


A 


G 


ACC to GCC 


Thr71Ala 


763 


A 


T 


CAT to CTA 


His230Leu 


764 


T 


A 







Position as 
defined in SEQ ID 
NO.l 


Published 
sequence 
AC008569 


Sequence as 
determined in 
SEQ ID NO.l 


2912 


AA 


A 


1088 


T 


C 


2264 


A 


C 


2275 




c 


2287 


A 


c 


2699 


A 


c 


2860 


A 


c 


2985 


A 


T 


3060 


T 


c 


3062 


G 


c 


3729 


A 


G 



Example .3 
5 Engineered RFLPs 



Position 


Diagnostic fragment 


Forward Primer 


Reverse Primer 


1160 


1082-1182 


1082-1102 


1161-1182 Bfal 


1458 


1082-1477 


1082-1102 


1459-1477 Apa I 


1656 


1434-1682 


1434-1453 


1657-1682 MscI 


2448 


2351-2472 


2351-2371 


2449-2472 Sac E 


3622 


3372-3646 


3372-3395 


3623-3646 Spe I 
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AIl positions refer to the positions in SEQ ID NO. 1 . 

Primer 1161-1 182 BfaT 
5 CTTCATGCCCTCCTCCTCCCTA SEQIDN0.2 
C at position 1 160 creates a Bfa I fragment in the diagnostic fragment. 
Primer 1459-1477 Apa T 

CCTGCCCCATGGACGGGCC SEQ ID N0.3 

G at position 1458 creates an Apa I site in the diagnostic fragment. 
10 Primer 1657-1682 Msc I 

TCTGATAGCTCTCACCCATTTTGGCC SEQ ID N0.4 

T at position 1656 creates a Msc I site in the diagnostic fragment. 
Primer 2449-2472 Sac TT 

TCCACGGCCATGCCACAGCCCCGC SEQ ID N0.5 

15 C at position 2448 creates a Sac 11 site in the diagnostic fragment. 
Primer 3623-3646 Spe I 

GGAGGCAAGGCGCACGGCCAGGAC SEQ ID N0.6 

A at position 3622 creates a Spe I site in the diagnostic fragment. 



